Practical Prediction of Inductive Generalization by the Method of Random Falsifiability
نویسنده
چکیده
The relationship between learner capacity and generalization, established by Vapnik’s uniform convergence framework, suggests a remarkably simple validation procedure: reject the inductive hypothesis if the learning algorithm can also store the training examples with random labels. We study the properties of this apparently coarse estimator, which can be considered as the simplest possible capacity control method. First, the method is theoretically derived, under reasonable assumptions, from the uniform convergence requisites for generalization. Then, exhaustive experiments are performed on synthetic and natural data. Finally, we characterize the kind of learning situations in which the validation method is appropriate. We also discuss some implications with respect to the theoretical limits of pure induction. In summary, we will show that the method of random falsifiability is a satisfactory qualitative generalization predictor for learning machines with controllable capacity that try to minimize the empirical error. It is especially useful in adverse situations in which the problem distribution and the properties of the learning machine (e.g. its VC dimension) are unknown and additional test examples are not available.
منابع مشابه
Islamic Economy and Its Relation with Economic
The re-evaluation of the scientific method at the 20th marked the inductive method as the scientific reasoning. In the contemporary science, the falsifiability plays an important role. Particularly, the science is considered to be a collection of proposition which describe the observational realities which can be falsify throughout the empirical experiments. Economics as a contemporary science ...
متن کاملDiscovering Functional Sites of Amino Acid Sequences Using Sorted Variable Generalization
This research develops a method for discovering functional sites of amino acid sequences using an Inductive Logic Programming (ILP) method with sorted variable generalization. Functional sites provide clues to building a knowledge base for prediction of protein functions from amino acid sequences. The proposed method generates hypotheses of functional sites directly from aligned amino acid sequ...
متن کاملSampling analysis of concrete structures for creep and shrinkage with correlated random material parameters
The latin hypercube sampling method, which represents the most efficient way to determine the statistics of the creep and shrinkage response of structures, has previously been developed and used under the assumption that the random parameters of the creep and shrinkage prediction model are mutually independent. In reality they are correlated. On the basis of existing data, this paper establishe...
متن کاملA Generalization of Initial Conditions in Benchmarking of Economic Time-Series by Additive and Proportional Denton Methods
The paper presents unified analytical solution for combining high-frequency and low-frequency economic time-series by additive and proportional Denton methods with parametrical dependence on the initial values of variable and indicator in evident form. This solution spans Denton’s original and Cholette’s advanced benchmarking initial conditions as the subcases. Computational complexity of the o...
متن کاملThe Porosity Prediction of One of Iran South Oil Field Carbonate Reservoirs Using Support Vector Regression
Porosity is considered as an important petrophysical parameter in characterizing reservoirs, calculating in-situ oil reserves, and production evaluation. Nowadays, using intelligent techniques has become a popular method for porosity estimation. Support vector machine (SVM) a new intelligent method with a great generalization potential of modeling non-linear relationships has been introduced fo...
متن کامل